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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1 . (Currently amended) A device for automatically identifying the language of a 
digital text, comprising: 

means for prestoring first character strings , including prefixes, suffixes and infixes, of 
different lengths from words of a plurality of predetermined languages, that occur 
frequently anywhere respectively in said w ords of [[a]]said plurality of predetermined 
languages and charactor i zo sa i d prodotorm i nod l anguages , 

means for prestoring second character strings of different lengths, that are 
atypical anywhere respectively in said w ords of said predetermined languages, 

means for analyzing words extracted from said digital text, thereby constructing 
for each extracted word [fallll a plurality of character strings contained in said extracted 
word , including prefixes, suffixes and infixes, with overlap and different and hav i ng 
lengths lying between one character and the number of characters in said extracted 
word, and 

moans for compar i ng charactor str i ngs conta i ned i n oxtractod words to prostorod 
character str i ngs i n ordor to dotorm i no scores assoc i ated w i th sa i d prodotorm i nod 
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means for i nd i v i dua ll y comparing each of [[allll said plurality of character strings 
contained in each said extracted word to said first and second prestored character 
strings of each dotorm i nod l anguago said predetermined languages so that whenever a 
first character string is found in said extracted word, a score associated with said 
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eael^one determined language is increased by a first coefficient depending on tine 
position of said first character string of said one determined language is found in said 
extracted word, and, whenever a second character string is found in said extracted 
word, said score is decreased by a respective second coefficient that is associated with 
said found second character string, said respective second coefficient increasing as the 
probability of said found second character string in said oach one determined language 
decreases, and 

means for comparing said scores for said text associated with said 
predetermined languages in order to determine the highest of said scores, which 
identifies the language of said text. 

2. (Cancelled) 

3. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word depends on the frequency of said character 
string in said determined language. 

4. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word depends on the length of said character 

string. 

5. (Original) The device claimed in claim 1, wherein said first coefficient of a 
first character string in said extracted word is equal to: 

PO (FR+LON), 

where 
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PO is a coefficient depending on the position of said first character string in said 

extracted word, 

FR is a coefficient depending on the frequency of said first character string in a 
determined language, and 

LON is a coefficient depending on the length of said first character string. 

6. (Previously presented) The device claimed in claim 1, comprising 
comparator means for comparing each of said extracted words from said text with 
frequent words in said determined language and initially listed in storage means so that 
whenever a frequent word is found in said text, said score for said determined language 
is increased only by a coefficient depending on the frequency of said extracted word in 
said determined language. 

7. (Previously presented) The device claimed in claim 1, comprising 
comparator means for comparing each of said extracted words from said text with 
frequent words in said determined language and initially listed in storage means so that 
whenever a frequent word is found in said text, said score for said determined language 
is increased only by a coefficient depending on the length of said frequent word. 

8. (Currently amended) A method of automatically identifying the language of a 
digital text, comprising the steps of: 

prestoring first character strings , including prefixes, suffixes and infixes, of 
different lengths from words of a plurality of predetermined languages, that occur 
frequently anywhere respectively in words of [fall said plurality of predetermined 
languages that charactor i zo sa i d prodotorm i nod l anguages ; 
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prestoring second character strings of different lengths, that are atypical 
anywhere respectively in said w ords of said predetermined languages; 

analyzing words extracted from said digital text, thereby constructing for each 
extracted word rrallll a plurality of character strings contained in said extracted word^ 
including prefixes, suffixes and infixes, with overlap and different and hav i ng lengths 
lying between one character and the number of characters in said extracted word, and 

comparing charactor str i ngs conta i nod i n oxtractod words to prostorod oharactor 
str i ngs i n ordor to dotorm i no scores assoc i ated w i th sa i d prodotorm i nod l anguages, 

i nd i v i dua ll y comparing each of rrallll said plurality of character strings contained in 
each said extracted word to said first and second prestored character strings of eact^ 
dotorm i nod l anauaao said predetermined languages so that whenever a first character 
string is found in said extracted word, a score associated with oach sa i d one determined 
language is increased by a first coefficient depending on the position of said first 
character string found in said extracted word, and, whenever a second character string 
of said one determined language is found in said extracted word, said score is 
decreased by a respective second coefficient that is associated with said found second 
character string, said respective second coefficient increasing as the probability of said 
found second character string in oach said one determined language decreases, and 

comparing said scores for said text associated with said predetermined 
languages in order to determine the highest of said scores, which identifies the 
language of said text. 

9. (Cancelled) 
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10. (Previously presented) The method of claim 8, wherein said first coefficient 
of a first character string in said extracted word is equal to: 

PO (FR+LON), 

where 

PO is a coefficient depending on the position of said first character string in said 

extracted word, 

FR is a coefficient depending on the frequency of said first character string in a 
determined language, and 

LON is a coefficient depending on the length of said first character string. 

11. (cancelled) 

1 2. (new) A device for automatically identifying the language of a digital text, 
comprising: 

means for prestoring first character strings that occur frequently anywhere 
respectively in words of a plurality of predetermined languages and characterize said 
predetermined languages, 

means for prestoring second character strings that are atypical anywhere 
respectively in words of said predetermined languages, 

means for analyzing words extracted from said digital text, thereby constructing 
for each extracted word all character strings contained in said extracted word and 
having lengths lying between one character and the number of characters in said 
extracted word. 
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means for comparing character strings contained in extracted words to prestored 
cliaracter strings in order to determine scores associated with said predetermined 
languages, 

means for individually comparing each of all character strings contained in each 
said extracted word to said first and second prestored character strings of each 
determined language so that whenever a first character string is found in said extracted 
word, a score associated with said each determined language is increased by a first 
coefficient depending on the position of said first character string found in said extracted 
word, and, whenever a second character string is found in said extracted word, said 
score is decreased by a respective second coefficient that is associated with said found 
second character string, said respective second coefficient increasing as the probability 
of said found second character string in said each determined language decreases, and 

means for comparing said scores for said text associated with said 
predetermined languages in order to determine the highest of said scores, which 
identifies the language of said text, wherein said first coefficient of a first character string 
in said extracted word is equal to: 

PO (FR+LON), 

where 

PO is a coefficient depending on the position of said first character string in said 

extracted word, 

FR is a coefficient depending on the frequency of said first character string in a 
determined language, and 

LON is a coefficient depending on the length of said first character string. 
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